Correlated Components Analysis - Extracting Reliable Dimensions in Multivariate Data

نویسندگان

  • Lucas C. Parra
  • Stefan Haufe
  • Jacek Dmochowski
چکیده

How does one find data dimensions that are reliably expressed across repetitions? For example, in neuroscience one may want to identify combinations of brain signals that are reliably activated across multiple trials or subjects. For a clinical assessment with multiple ratings, one may want to identify an aggregate score that is reliably reproduced across raters. The approach proposed here — “correlated components analysis” — is to identify components that maximally correlate between repetitions (e.g. trials, subjects, raters). This can be expressed as the maximization of the ratio of between-repetition to within-repetition covariance, resulting in a generalized eigenvalue problem. We show that covariances can be computed efficiently without explicitly considering all pairs of repetitions, that the result is equivalent to multi-class linear discriminant analysis for unbiased signals, and that the approach also maximize reliability, defined as the mean divided by the deviation across repetitions. We also extend the method to non-linear components using kernels, discuss regularization to improve numerical stability, present parametric and non-parametric tests to establish statistical significance, and provide code.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Principal Component Analysis, A Powerful Unsupervised Learning Technique

Data mining is a collection of analytical techniques to uncover new trends and patterns in massive databases. These data mining techniques stress visualization to thoroughly study the structure of data and to check the validity of the statistical model fit which leads to proactive decision making. Principal component analysis (PCA) is one of the unsupervised data mining tools used to reduce dim...

متن کامل

Relationship between Yield and its Component in Soybean Genotypes (Glycine Max L.) using Multivariate Statistical Methods

18 soybean genotypes were examined to investigate the relationships between some principal attributions of morphology with seed yield per soybean, by Random Complete Block Design (RCBD) study. This study was also carried out three replicates to gain reliable results. The results of variance analysis indicated that, there were significance differences among all soybean genotypes. Moreover, the r...

متن کامل

Modelling of Correlated Ordinal Responses, by Using Multivariate Skew Probit with Different Types of Variance Covariance Structures

In this paper, a multivariate fundamental skew probit (MFSP) model is used to model correlated ordinal responses which are constructed from the multivariate fundamental skew normal (MFSN) distribution originate to the greater flexibility of MFSN. To achieve an appropriate VC structure for reaching reliable statistical inferences, many types of variance covariance (VC) structures are considered ...

متن کامل

Multivariate Statistical Analysis Decision-making Hybrid Method for Road Traffic Safety Evaluation in Iran

Obviously, improving the road safety and the efficient allocation of limited resources to the provinces according to their ranking should be done. This paper presents a hybrid method of multivariate statistical analysis-decision making to evaluate Iran road traffic safety. In order to solve the problems of road traffic safety, a macroscopic evaluation and traffic safety level classification in ...

متن کامل

Attachment styles and emotional intelligence components: the predictors of health dimensions

Health, as one of the most important sources of comfort in life, is the complete physical, mental and social well-being, while there are dynamic mutual relationships among the three components. This study was aimed to investigate the role of attachment styles and emotional intelligence components in the prediction of health dimensions. The statistical population was consisted 160 parents who pa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1801.08881  شماره 

صفحات  -

تاریخ انتشار 2018